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© A facility is provided for storing in a backup 
memory (30-1 -P) only those blocks of a file, or 
disk partition, which differ from corresponding blocks 
forming an earlier version of the file. Specifically, a 
file Is divided into a number of blocks and a 
"signature" is generated for each such block. A 
block is then stored in the backup memory only if its 
associated signature differs from a signature gen- 
erated for an earlier version of the block. In addition, 
if two blocks of the current version of the file have 
identical signatures and are to be stored in the 
backup memory, then only one of the two blocks is 
stored in the memory and a simple message in- 
dicating that the other block is equal to the one 
block Is stored in the memory for the other block. 
Further, the application of such signatures is ad- 
vantageously applied to the opposite case of re- 
storing a file using copies of previous versions of the 
file that are stored in the backup memory. 
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Technical Field 

The invention relates to storing a computer file 
or memory partition in a backup memory. 

Background of the Invention 

Conventional computer file backup techniques 
allow what is commonly referred to as an In- 
cremental backup of a file using a time stamp 
associated with the file. As a result of such backup 
techniques, different versions of a file may be 
stored in the memory of a backup media. The 
capacity of the backup media is generally not 
overburdened when the size of a file being stored 
on the media is small. However, when the size of a 
file and each version thereof is very large, or the 
file is a disc partition, then the capacity of the 
backup media may be used up quickly. This 
problem is especially true when the difference be - 
tween two versions of a large file, or disc partition, 
Is not great, since It results in storing in the backup 
media two slightly different versions of the same 
file each of which is very large. 

Summary of the Invention 

The above problems are dealt with in accord 
with the principles of the invention by dividing a 
file, or disc partition. Into blocks, generating a sig - 
nature for each such block, in which the signature 
is Indicative of the values forming the contents of 
the associated block, and then storing In a backup 
memory only those blocks whose signatures differ 
from signatures generated for con'esponding blocks 
of a previous version of the file, or disc partition. In 
the event that a previous version of the file does 
not exit, then all blocks of the current file are stored 
in the backup memory. 

As an aspect of the Invention, if at least two 
blocks have the same signatures and are to be 
stored in the backup memory, then only one of the 
two blocks Is stored in the backup memory and a 
message is stored in the backup In place of the 
other block, in which the message simply indicates 
that the other block is identical to the one block. 

In accord with other aspects of the invention 
described below in detail, such signatures are used 
in the restoration of a file employing earlier version 
of the file that may be stored in the backup. 

Brief Description of the Drawing 

FIG. 1 shows a broad block diagram of a com - 
puter archiving system In which the principles of 
the invention may be practiced; 
FIG. 2 is an illustrative example of a table of 
block signatures generated in accord with an 



aspect of the invention for a file F1 ; 

FIG. 3 is an illustrative example of one possible 

way of storing file F1 In a so-called archive 

memory; 

5 FIG. 4 is an illustrative example of a table of 
block signatures generated in accord with an 
aspect of the invention for a later version of file 
F1; 

FIG. 5 is an illustrative example of one possible 
10 way of storing in the archive memory selected 
blocks of the later version of file F1 ; 
FIG. 6 is an illustrative example of a so-called 
global bit map that is used in accord with an 
aspect of the invention in supplying a backup of 
76 an archived file; 

FIGs. 7 and 8 illustrate in flow chart form a 
program which implements the principles if the 
invention in a client computer, such as the 
computers 10 of FIG. 1; 
20 FIGs. 9, 10 and 11 illustrate in flow chart form a 
program which implements the principles of the 
invention in an archive computer, such as com - 
puter 110 of FIG. 1; and 

FIG. 12 shows how FIGs. 9 and 10 should be 
25 arranged with respect to one another. 

Detailed Description 

Turning now to FIG. 1, archiving system 100 

30 includes computer 110 and hard disc unit 115. The 
software which drives system 100 is stored in disc 
115. Computer 110, which may be, for example, 
the SPARCSTATION 2 commercially available from 
Sun Microsystems, Inc., operates in a conventional 

35 manner to periodically poll individual ones of 
computers 10-1 through 10- N via data network 
20. Data network 20 may be, for example, the 
well-known Ethernet network. Computer 110 in- 
vokes such polling on a scheduled basis (e.g., 

40 daily, weekly, monthly, etc.) and does so for the 
purpose of storing in one of memories 30-1 
through 30 - P the contents of the memory asso - 
dated with the computer that is being polled, e.g., 
computer 10-1. Such contents typically comprise 

45 a plurality of named files composed of data and/or 
programs, and may be on the order of, for exam - 
pie, forty megabytes to several gigabytes of 
memory. In an illustrative embodiment of the in- 
vention, each of the memories 30 - 1 through 30 - 

50 P may be, for example, a so-called rewritable 
optical disc library unit (commonly referred to as a 
"jukebox"). One such "jukebox" is the model 
OL112-22 unit commercially available from 
Hitachi with each such unit having a number of 644 

55 megabyte optical disc drives that are also com - 
mercially available from Hitachi. In the practice of 
the invention, each of the computers 10-1 through 
10 -N may be either a personal computer, 
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minicomputer or a large main frame computer. In 
addition, each of the disc units 11-1 through 11 - 
M may actually be one or more disc units. (Herein 
each of the designations J, M, N and P shown in 
FIG. 1 is a respective integer. In addition, the term 
"file" is taken to mean a program, data, disc parti - 
tion or contents of a memory including any subset 
thereof.) 

Assume that computer 110 is engaged in an 
archiving session with one of the computers 10-1 
through 10- N. e.g.. computer 10-1. The latter 
computer unloads from its associated disc unit a 
file that is to be archived and supplies the file, 
block by block, to computer 110 via network 20 for 
storage in one of the backup memories 30-1 
through 30 -P. e.g., memory 30-1. In an illustra- 
tive embodiment of the invention, a block of a file 
that is to be archived comprises a predetermined 
number of data bytes - - Illustratively 1000 data 
bytes. In addition, each such file is preceded by a 
file header identifying, inter alia, the name of 
computer 10-1, path name of the file currently 
being passed, the date of the last change made to 
the file, as well as other information associated with 
the file. However, a block of a file is passed to 
computer 110 only if the block had not been pre- 
viously archived. That is, computer 10-1, in ac- 
cord with the invention, calculates a signature for 
each block of a file that is to be archived, in which 
a signature is indicative of the values of the bytes 
forming the contents of the associated block. 
Computer 10-1 then supplies to computer 110 
only those blocks of the file having signatures 
which are different from conresponding signatures 
generated during a prior archiving session involving 
the same file. 

Assume that computer 10-1 desires to store 
(archive) on memory 30-1 via computer 110 a 
new file F1. In doing so, computer 10-1 generates 
a signature for each block of data forming file F1 
and stores each such signature in a table that is 
assigned to file F1 and stored in the internal 
memory of computer 10-1. Each such signature is 
stored in the table at a location corresponding with 
the address (e.g., sequence number) of its asso - 
ciated block. 

In implementing the invention, a signature may 
be generated using any one of a number of dif- 
ferent code generation techniques. In an illustrative 
embodiment of the invention, a block signature is 
generated by passing the data forming a respective 
block through a conventional Cyclic - 
Redundancy -Code (CRC) generator, which may 
be implemented in software. Accordingly, if file F1 
comprises N blocks, then the file F1 table would 
contain N CRC entries. 

An example of such a table of signatures is 
shown in FIG. 2, in which Table 200 comprises N 



entries corresponding to the number of blocks for- 
ming file F1. In the FIG., a respective signature is 
represented by a letter designation, e.g., CRC, and 
a numerical designation. Thus, the signatures 

6 generated for blocks 1, 2, and 3 of file F1 are 
respectively represented in Table 200 by the des - 
ignations CRC1 , CRC2 and CRC3. The signatures 
associated with the remaining block forming file F1 
are similarly represented. 

10 Since it is assumed that file F1 is a new file, a 
copy of which had not been previously stored on 
memory 30-1, then computer 10-1 passes to 
computer 110 via network 20 each block forming 
file F1, Computer 110, in turn, stores each such 

75 block as it is received in one of the archive 
memories 30-1 through 30 -P, e.g., memory 30- 
1, 

In accord with an aspect of the invention, 
computer 10-1 does not pass to computer 110 a 

20 block of file F1 that is identical to another block of 
file F1 that has been passed to computer 110 
during the current archiving session. That is, before 
computer 10-1 supplies to computer 110 a cur- 
rent block of file F1, computer 10-1 compares the 

25 signature associated with that block with the sig - 
natures of blocks that have been supplied to com - 
puter 110 during the current archiving session. If 
computer 10-1 finds that such a comparison ex- 
ists, then computer 10-1 does not archive the 

30 associated block. Instead, computer 10-1 supplies 
to computer 110 a message indicating that the 
current block is identical to one that has been 
archived (stored) during the current session. 

For example, assume that blocks 80 ad 81 of 

35 file F1 are identical to block 28. In that case then, 
the signatures generated for blocks 80 and 81 
would be identical to the signature generated for 
block 28. Accordingly, computer 10-1 does not 
supply blocks 80 and 81 to computer 110. Instead, 

40 computer 10-1 supplies to computer 110 mes- 
sages, or flags, respectively identifying the fact that 
blocks 80 and 81 are identical to block 28. Com - 
puter 110, in turn, stores the messages in the 
archive memory 30-1 in the order that the mes- 

45 sages are received via network 20. 

Turning now to FIG. 3, there is shown a ex- 
ample of one way in which computer 110 may 
store in archive memory 30 - 1 the blocks of file F1 
that computer 110 receives from computer 10-1. 

50 It is seen from the FIG. that the stored blocks of 
file F1 are preceded in memory by header 301, in 
which header 301 includes, inter alia, a time stamp 
and file name identifying file F1. The file name also 
include other information (not shown) identifying, 

55 for example, the so-called pathname associated 
with file F1 . The blocks forming file F1 are stored 
in sequence in memory 30-1, in which each such 
block is preceded by its respective block number. 
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as shown at 302 and 303 for blocks one (1) and 
two (2), 

It is also seen from the FIG. at 304 that com- 
puter 110 has stored In archive memory 30-1 and 
in place of blocks 80 and 81 the messages that 
computer 110 received from computer 10-1, As 
shown at 304 each such message includes its 
associated block number, followed by a so-called 
flag and the identity of block 28. Thus, two rela- 
tively brief messages are stored In the archiving 
memory, rather than the respective blocks them- 
selves. 

Assume that, after a period of time following 
the initial archiving of file F1, computer 10-1 
communicates with computer 110 for the purpose 
of storing In one of the archive memories 30-1 
through 30 -P, e.g., memory 30-1, the latest 
version of file F1. In doing so, computer 10-1 
generates a signature for each block forming the 
latest version of file F1 and stores each such 
signature in sequence In a table formed in the 
internal memory of computer 10- 1. An example of 
the latter table is shown in FIG. 4. 

Following the foregoing, computer 10-1 then 
compares each entry in the newly formed table 400 
with its corresponding entry in previously formed 
table 200. Computer 10-1 does so to determine 
which blocks forming the latest version of file F1 
differ from their corresponding blocks forming the 
initial, or preceding, version of file F1 . In FIG. 4, an 
"X" is used to indicate that a signature in table 400 
is different from the corresponding signature en- 
tered in Table 200 of FIG. 2. That is, the signatures 
at locations 2. 5, 6 and N of Table 400 differ from 
the signatures stored at the corresponding loca- 
tions of Table 200. Table 400 also includes an 
additional signature associated with block (or partial 
block) N + 1 of the latest version of File F1 . 

Once it is armed with the results of the com - 
parison, computer 10-1, then, in accord with an 
aspect of the invention, supplies to computer 110 
for archiving purposes only those blocks forming 
the latest, or current, version of file F1 that differ 
from their corresponding blocks forming the next 
preceding version of file F1. Thus, in the present 
example, computer 10-1 supplies to computer 
110 those blocks of the current version of file F1 
that are associated with the Table 400 signature 
entries of 2, 5, 6, N and N + 1. In addition, com- 
puter 10-1 retains a copy of the contents of Table 
400 so that the table of signatures may be used in 
connection with archiving the next, succeeding 
version of file F1 . 

Turning now to FIG.5, there is shown an ex- 
ample of one way in which Computer 110 may 
store on memory 30-1 the blocks of the latest 
version of file F1 that computer 110 receives from 
computer 10-1. It is similarly seen from FIG. 5 



that the stored blocks of file F1 are preceded in 
memory by a header 501 , in which time stamp 2 is 
associated with the latest version of file F1. Like 
FIG. 3, computer 110 has stored blocks 2, 5, 6, 28, 
5 N and N + 1 in the order that they were received 
from computer 10-1, with each block identified by 
its associated block number. However, in contrast 
to FIG. 3 and in accord with an aspect of the 
invention, only those blocks of the latest version of 

10 file F1 which differ from their corresponding blocks 
forming the previous version are stored in the 
backup memory. Advantageously, then, the two 
versions of file F1, i.e., the initial version identified 
by time stamp 301 (FIG. 3) and the latest version 

15 identified by time stamp 501 (FIG. 5), are stored on 
archive memory 30-1 such that the latter version 
uses significantly less memory space than the 
former version. 

As is well-known, the reason for archiving 

20 different versions of a file is to provide a backup 
copy of the file whenever such a backup is re- 
quired. For example, assume that the current ver - 
sion of file Fl that had been stored in memory disc 
unit 11-1 was lost or destroyed. In such a case, a 

25 user associated with that file may enter via com - 
puter 10-1 a request for a copy of a preceding 
version of file Fl. Computer 10-1, in turn, sends 
to computer 110 a message requesting a copy of 
file Fl, in which the message includes a time 

30 stamp associated with the desired version. As- 
suming that the desired file Fl is associated with 
time stamp 2 identifying the version designated 
500 in FIG. 5, (also referred to herein as version 
501), then computer 110 unloads each block for- 

35 ming that version and supplies the block to com - 
puter 10-1 for storage on disc memory unit 11-1. 

Specifically, computer 110 first identifies in a 
conventional way the starting location at which 
version 501 of file Fl is stored in memory 30-1. 

40 Armed with that information, computer 110 then 
unloads each block in sequence starting with block 
2 and ending with block N + 1 and supplies each 
such block as it is unloaded to computer 10-1 via 
network 20. In doing so, computer 110 tracks in a 

45 so-called "global" bit map stored in scratch pad 
memory internal to computer 110 each block of the 
backup version of file Fl that it supplies to com - 
puter 10-1. For example, when computer 110 
supplies to computer 10-1 block 2 of version 501, 

50 it sets that bit in the bit map having a bit position 
corresponding with the number 2, i.e., the second 
bit position. As a further example, when computer 
110 supplies the next block 5 of version 501, it 
then sets the fifth bit position in the bit map. 

55 (Herein, the term "set a bit in the bit map" means 
to set the pertinent bit to a particular logical value, 
e.g., a binary one.) Accordingly, once computer 
110 has supplied to computer 10-1 the blocks 
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forming stored version 501, then the bits located at 
bit positions 2.5, 28, N and N 1 in the global bit 
map would be set to a logical one. (An example of 
such a bit map is shown in FIG. 6, in which the 
aforementioned bit locations in map 700 are set as 
represented by the respective dots.) The purpose 
for maintaining the global bit map will be made 
apparent below. 

Following the foregoing, computer 110 then 
determines, in a conventional manner, the starting 
location at which a next preceding version of file 
F1, if any, is stored in memory 30-1. For the 
present example, that version would be the version 
designated 300 in FIG. 3 (also refen^ed to herein as 
version 301). Accordingly, computer 110 unloads 
from archive memory 30-1, the first block number 
stored therein and checks the value of the bit at the 
corresponding bit position in the global bit map. In 
the present example, the bit at position one of the 
global bit map would not be set, thereby indicating 
that computer 110 has not yet supplied block 1 of 
file F1 to computer 10-1. Accordingly, computer 
110 unloads from the archive memory 30-1 block 

1 of version 301 and supplies the block to com - 
puter 10-1 via network 20. In addition, computer 
110 sets the bit at bit position one in the global bit 
map to indicate that block 1 has been supplied to 
computer 10-1. Computer 110 then unloads from 
stored version 301 the next block number, namely 
block number 2. 

For block number 2, computer 110 would find 
that the bit at bit position two in the global bit map 
Is set to a logical one, thereby indicating that block 

2 (namely block 2 of later version 501) has been 
supplied to computer 10-1. In this instance, then, 
computer 110 would not unload from the archive 
memory 30-1 the associated block 2, but would 
go on to unload the next block number, i.e., block 
numt)er 3. For block numbers 3 through 27 com - 
puter 110 would find that the bits located at the 
respective corresponding positions in the global bit 
map would not be set. Therefore, computer 110 
unloads the associated blocks in sequence from 
the archive memory 30-1 and supplies them to 
computer 10-1 as they are unloaded. Similarly, 
computer 110 sets the bits located at the cor- 
responding bit positions in the global bit map. 

Computer 110 then unloads block number 28 
from stored version 301. However, in doing so 
computer 110 would find that the bit at position 28 
in the global bit map is set to a logical 1, thereby 
Indicating that block 28, that is, block 28 of version 
501 (FIG, 5), has been supplied to computer 10-1. 
Accordingly, computer 110 would not unload from 
stored version 301 block 28. 

In a similarly manner, computer 110 unloads 
blocks 29 through 79 of stored version 301 and 
supplies them to computer 10-1. 



Computer 110 similarly maintains in its scratch 
memory another bit map (local bit map) associated 
with the stored version of a file that computer 110 
is currently unloading. For example, for each block 
5 that computer 110 unloads from version 501 (FIG. 
5). it sets the corresponding bit in the global bit 
map and in the local bit map associated with ver - 
sion 501. Computer 110 does so to track by block 
number the blocks that have been unloaded from a 

10 particular archived version of a file. The underlying 
reason for maintaining a local bit map will be made 
apparent below. 

Thus, as a result of unloading the blocks 
identified by the numbers 1, 3, 4, 7-27 and 29- 

75 79 of stored version 301 and supplying them to 
computer 10-1, the bits located at bit positions 
corresponding with those numbers in the asso- 
ciated local bit map would also be set. (An exam - 
pie of a local bit map would be somewhat similar to 

20 global bit map 700 depicted in FIG. 6.) Following 
the foregoing, computer 110 then goes on to un- 
load the block 80 message from stored version 
301, which, as mentioned above, indicates that 
block 80 is identical to block 28. If computer 110 

25 consulted the global map it would find that the bit 
at position 28 is set to a logical one. (As mentioned 
above, the latter bit position had been set as a 
result of computer 110 unloading and supplying to 
computer 10-1 block 28 of version 501 .) However, 

30 the block 80 message actually means that block 80 
is identical to block 28 of version 301 and not 
version 501. Accordingly, computer 110 is ar- 
ranged so that when it encounters a stored block 
message, such as the block 80 message, it con - 

35 suits the associated local bit map, rather then the 
global bit map. Computer 110 does so to ac- 
curately determine whether a particular block has 
been supplied to the requesting computer 10. 
Thus, the flag associated with the block 80 mes - 

40 sage causes computer 110 to consult the local bit 
map associated with version 301 to determine if 
block 28 of that version has been supplied to 
computer 10-1. Since the local bit map indicates 
that computer 110 did not supply that block, then, 

45 computer 110 unloads the block 28 of version 301 
and supplies that block to computer 10-1 as block 
80. In doing so, computer 110 sets the bit located 
at bit position 80 in both the global bit map and the 
associated local bit map. In addition, computer 110 

50 notes in its scratch pad memory that block 80 is a 
duplicate of block 28. Computer 110 then goes on 
to unload the block 81 message. 

In response to the block 81 message computer 
110 would consult the local bit map associated with 

55 version 301 to determine if block 28 has been 
supplied to computer 10-1. As a result thereof, 
computer 110 would conclude that block 28 of 
version 301 has not been supplied to computer 
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10- 1. However, at that point, computer 110 would 
consult the note It stored in its scratch pad to 
determine if block 28 of version 301 had been 
supplied as a duplicate of another block of version 
301. Computer 110 would thus find stored in Its 
scratch pad memory the notation indicating that 
block 28 had been supplied to computer 10-1 as 
a duplicate of block 80. Accordingly, computer 110 
changes the block 81 message to indicate that 
block 81 is a duplicate of block 80 and supplies the 
changed block 81 message to computer 10-1. 
Computer 10-1 responsive to receipt of the mes - 
sage then creates a copy of priorly received block 
80 and stores that copy In its associated disc 
memory unit 11-1 as block 81 . 

Following the foregoing, computer 110 contin- 
ues supplying the backup versions in the described 
manner until the last block thereof is supplied to 
computer 10-1. In the present example, the last 
block would be block N + 1 of version 301. It is 
noted that if a version of F1 had been archived 
prior to version 301, then computer 110 would go 
on to process that prior version in the described 
manner in order to restore file F1 in disc unit 
memory 11-1. 

As an aspect of the invention, computers 10-1 
through 10 -N are also arranged such that, when- 
ever they supply the total contents of their asso- 
ciated memories 1 1 to computer 1 1 0 for storage on 
one of the archive (backup) memories 30, they 
generate a table of respective block signatures 
across such contents. (Such signatures would be 
stored in a respective table in the form shown in 
FIGs. 2 and 4.) Thereafter, if a fault caused a 
particular disc memory unit, e.g., memory 1 1 - 1, to 
become inoperable, and, therefore, had to be re- 
placed, then the replacement disc unit could be 
loaded with a backup copy of the latest archived 
version as a way of restoring the contents of that 
disc unit as it existed at a prior point in time T. 

More particularly, if a replaced disc unit mem - 
ory was one of a group of such disc units, e.g.. 

11 - J through 11 -M associated with computer 
10 -N, then the backup copy that is stored on the 
replacement disc unit would not be current. 
Whereas as the contents of the other disc memory 
units would be current. The contents of the re- 
placement disc unit, therefore, might not possibly 
agree in time with the contents of the other disc 
memory units of the group. One approach to this 
problem is to restore the contents of the other 
discs to the same point in time T. However, if the 
capacity of such disc units is very large, e.g., on 
the order of a gigabit, then the restoration process 
would consume an inordinate amount of time. 

The computers 10-1 through 10 -N are ar- 
ranged to take a different approach, one which is 
significantly faster than the suggested approach. 



For example, assume that the group comprises two 
disc memory units 11 - J and 11 -M in which disc 
memory 1 1 - M is replaced and the contents of 
memory 1 1 - M is fully restored to the way it ex - 

5 isted at the prior point in time T. To obtain coher- 
ency between the respective contents of memories 
11 -J and 11 -M, computer 10 -N may then, in 
accord with an aspect of the invention, restore the 
contents of memory 1 1 - J as it exited at time T 

10 without requesting a full restoration of that disc 
unit. Specifically, computer 10 -N, in accord with 
an aspect of the invention, generates for each 
block forming the contents of memory 11 - J re- 
spective signatures, thereby establishing a current 

75 table of signatures. Computer 10 -N then com- 
pares each signature in the current table with a 
corresponding signature contained in a prior table 
of signatures that was created at time T over the 
contents of memory 1 1 - J. For the present the 

20 example, assume that table 400 shown in FIG. 4 is 
the current table and that table 200 shown in FIG. 2 
is the prior table. With that assumption in mind, 
then, computer 10- N would note that since time 
T, blocks 2, 5, 6, 28, N and N + 1 had changed. 

25 Accordingly, to restore the contents of memory 
1 1 - J to the point as it existed at time T, then 
computer 10-1 supplies to computer 110 a re- 
quest for copies of the blocks identified by the 
aforementioned numbers, in which the request 

30 would include time stamp T. Assuming that time 
stamp T identifies archived version 301 (FIG. 3), 
then, computer 110 unloads from archive memory 
30-1 the requested blocks of version 301 and 
supplies them, in turn, to computer 10 -N. Com- 

35 puter 10 -N responsive to receipt of each re- 
quested block stores the block in memory 1 1 - J in 
place of its later version. Computer 10 -N thus 
achieves coherency in the described manner 
without resorting to restoring the total contents of 

40 memory 11 -J. 

FIGs. 7 and 8 illustrate in flow chart form a 
program which implements the principles of the 
invention in client computers 10-1 through 10- N. 
Similarly, FIGs. 8-11 illustrate in flow chart form a 

45 . program which implements the principles of the 
invention in archive computer 110. In view of the 
fact that FIGs. 7-11 are self-explanatory, espe- 
cially when viewed in conjunction with the fore- 
going detailed description, and in the interest of 

50 conciseness, no further explanation thereof is pro - 
vided herein. 

The foregoing is merely illustrative of the 
principles of the invention. Those skilled in the art 
will be able to devise numerous arrangements 

65 falling within the scope of the invention. For ex- 
ample, the task of generating and maintaining sig - 
natures and signature tables could be implemented 
in archive computer 110. In such an instance, then, 
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a computer, e.g., computer 10-1, would pass an 
initial or later version of a file to computer 110 for 
archiving purposes (In which the term file Includes 
the total contents of a disc memory). Computer 
110 would then generate the signatures for the 
blocks of the received file, and then store in one of 
the archive memories only those blocks of the 
received file having signatures which differ from 
their corresponding next preceding signatures. In 
addition, although memories 11-1 through 11 -P 
were defined herein as being disc memories, it is 
apparent that such memories could be another 
type of memory. For example, any type of mag - 
netic or optical memory media. 

Claims 

1. Apparatus for storing a file in a memory com - 
prising 

means for dividing said file into respective 
blocks and generating for each of said blocks 
a respective signature indicative of the values 
forming the contents of the associated block, 
and 

means for storing in said memory ones of 
said blocks of said file having respective sig - 
natures different from signatures generated for 
corresponding ones of said blocks of a prior 
version of said file. 

2. The apparatus set forth in claim 1 wherein said 
means for storing includes means for causing 
all of said blocks to be stored in said memory 
if said prior signatures do not exist. 

3. The apparatus set forth in claim 1 wherein the 
blocks of said file and blocks of said prior 
version of said file are associated with re- 
spective time stamps. 

4. The apparatus set forth in claim 1 further 
comprising means for storing said signatures 
in a table identified by a time stamp asso- 
ciated with said file, said signatures being 
stored in said table in the order that they are 
generated. 

5. The apparatus set forth in claim 1 wherein said 
means for storing includes means operative in 
the event that the signature generated for one 
of said blocks equals the signature generated 
for another one of said blocks and for then 
storing in said memory in place of said one 
block a message indicating that the contents of 
said one block equals the contents of said 
other block. 



6. The apparatus set forth in claim 5 wherein the 
blocks of said file and blocks of said prior 
version of said file are associated with re- 
spective time stamps. 

5 

7. The apparatus set forth in claim 6 further 
comprising 

computer means associated with said 
memory, said computer means comprising 

10 means responsive to receipt from an 

originator of a request requesting a restoration 
of said file for unloading from said memory the 
blocks of said file representing earlier versions 
of said file that may be stored in said memory 

75 and supplying them to said originator such that 
said unloading is based on the reverse order 
that said versions were stored in said memory. 

8. The apparatus set forth in claim 7 wherein 
20 groups of stored block are associated with 

respective versions of said file, in which the 
blocks forming respective ones of said groups 
are identified by respective block numbers, 
and wherein said means for unloading includes 

25 a global bit map comprising a plurality of 

bit locations associated with respective ones of 
said block numbers, 

means, responsive to a first one of said 
blocks being unloaded from said memory for 

30 setting in said bit map the bit whose location 
corresponds with the block number of said first 
block, and 

means, operative prior to the unloading of 
a second one of said stored blocks of a re - 
35 spective one of said groups, for preventing the 
unloading of said second block if its associated 
bit in said bit map had been set as a result of 
the unloading of a corresponding block of an - 
other one of said groups. 
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